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ABSTRACT In spite of the tremendous in- 
crease in the rate at which protein structures are 
being determined, there is still an enormous gap 
between the numbers of known DNA-derrved 
sequences and the numbers of three-dimen- 
sional structures. In order to shed light on the 
biological functions of the molecules, research- 
ers often resort to comparative molecular mod- 
eling- Earlier work has shown that when the 
sequence alignment is in error, then the com- 
parative model is guaranteed to be wrong. In 
addition, loops, the sites of insertions and dele- 
tions in famiHes of homologous proteins, are ex- 
ceedingly difficult to modeL Thus, many of the 
current problems in comparative molecular 
modeling are minor versions of the global pro- 
tein folding problem- In order to assess objec- 
tively the current state of comparative mole- 
cular modeling, 13 groups submitted blind 
predictions of seven different proteins of un- 
disclosed tertiary structure. This assessment 
shows that where sequence identity between^e 
target and the template structure is high (> 70%), 
comparative molecular modeling is highly suc- 
cessful. On the other hand, automated modeling 
techniques and sophisticated energy minimiza- 
tion methods fail to improve upon the starting 
structures when the sequence identity is low 
(-30%). Based on these results it appears that 
insertions and deletions are still major prob- 
lems. Successfully deducing the correct se- 
quence alignment when the local shnilarity is 
low is still difficult. We suggest some nimnnal 
testing of submitted coordinates that should be 
required of authors before papers on compara- 
tive molecular modeling are accepted for pub- 
li cation in journals. © 1335 Wiky-liss. Inc. 
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INTRODUCTION 

Once a protein's sequence has been determined and 
it has been found to be a new member of a structur- 
ally characterized protein family, it is relatively 
straightforward to build a molecular model of the 
protein using a set of simple guidelines. ■ Presently, 



there are several commercial and public domain com- 
puter programs, that have been developed for mod- 
eling; these programs remove much of the tedium 
from the process. There are numerous reasons for 
constructing comparative molecular models of pro- 
teins. The molecular model may explain the struc- 
tural basis of existing experimental results and can 
provide one with structural information on which 
further experiments can be planned, executed, and 
evaluated. Site-specific mutations of the gene coding 
for the specific protein can provide important data 
regarding the protein's function. Perhaps, some of 
the most revealing experiments are those designed to 
predict and to probe the molecular reasons for an 
enzyme's specificity. 3 On a more practical note, the 
molecular model can sometimes be used successfully 
to determine phases for a crystal structure determi- 
nation using the method of molecular replacement. 
The more spectacular uses, however, are typified by 
the recent successful application of comparative mo- 
lecular modeling for identifying new classes of lead 
compounds in antimalarial drug development. 

An example of the successful prediction of an en- 
zyme's specificity from comparative molecular mod- 
eling is that for granzyme B (CCPl), a serine pro- 
teinase from cytotoxic T lymphocytes. 3 A molecular 
model of CCPl (48% identical to rat mast cell pro- 
teinase II) showed that an arginine at position 226 
would occupy the specificity pocket, thereby sug- 
gesting a Pa specificity for an aspartate or gluta- 
mate residue- Subsequent synthesis and testing of a 
series of substrates differing in the nature of the PI 
residue confirmed the aspartate specificity of 
CCPl. 6 The P x specificity of CCPl has recently been 
altered by site-specific mutagenesis of the residue at 



•Tfeis assessment dot* not indicate that any one particular 
modeling group or modeling technique is opener to any ©taer. 
We do not believe that comparative molecular model* can oe 
ranked usiae a single or even several numeric indicators, as 

based upon the results herein are not justifiable, in out opin- 
ion. 
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